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Attorney IBKcet No.: 019717-001 1 10US 

LINEAR HALF-RATE PHASE DETECTOR AND CLOCK AND DATA 

RECOVERY CIRCUIT 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This application claims priority from U.S. Provisional Application Number 
60/183,170, filed February 17, 2000, which is incorporated by reference. 

BACKGROUND OF THE INVENTION 

The present invention relates generally to phase-locked loops, and more 
specifically to linear half-rate phase detectors and clock and data recovery circuits. 

Data networking has exploded over the last several years, and has changed the 
way people work, get information, and spend leisure time. Local Area Networks (LANs) in 
the workplace allow for centralized database and file sharing and archiving. Wireless 
Application Protocol (WAP) enabled mobile phones operating over a Wide Area Network 
(WAN) allow users to access news updates and stock quotes. The Internet has transformed 
shopping and research, and has spawned a new recreational activity - Web surfing. Many 
computers are used primarily as interfaces to these networks, thus the expression "the 
network is the computer" has become popularized. 

Devices such as Network Interface Cards (NICs), bridges, routers, switches, 
and hubs move data between users, between users and servers, or between servers. Data 
moves over a variety of media such as fiber optic or twisted pair cables, and the air. These 
media are similar in that they distort data, making it difficult to be read by a receiving device. 
Light-waves in a fiber optic cable travel not only down the cable's core, but bounce off the 
core-cladding interface, and thus tend to disperse. Twisted pair cables have filtering 
properties that tend to attenuate higher frequencies. This limited bandwidth also creates 
interference between individual data bits, known as Inter-Symbol Interference (ISI). 
Wireless signals tend to bounce off buildings and other surfaces in a phenomenon known as 
multipath, which results in the smudging of one data bit into the next. 

Therefore, each of these devices, NICs, bridges, routers, switches, and hubs, 
receive distorted data and must "clean it up", or retime it, for use either by the device itself, a 



device attached to it, or for re-transmission. A useful building block for this is the phase- 
locked loop (PLL). PLLs accept distorted data, and provide a CLOCK signal and retimed (or 
recovered) data as outputs. 

But the task for PLLs has lately begun to be a lot tougher. Equipment 
5 operating at data rates of one Gigabit per second is replacing 100 Megabit devices, which 
recently replaced 10 Megabit units. Exacerbating this problem is the competitive nature of 
the networking business itself. Pricing pressures are enormous, and using high speed, 
specialized processes raises system costs. Therefore, the goal is to create integrated circuits 
that are capable of operating at these data rates, but which can be made using relatively 
10 inexpensive process technologies. What is needed are PLLs which can be made 
^ inexpensively, while still operating at these high frequencies. 

3 SUMMARY OF THE INVENTION 

ry 

;=gl5 Accordingly, the present invention provides a clock and data recovery circuit. 

4 A voltage-controlled oscillator (VCO) operates at half the data rate. A half-rate phase 

Q detector provides two quadrature demultiplexed data outputs, as well as a differential pattern 
; : i independent linear output made up of an error signal and a reference signal. The lower clock 

rate enables the circuit to be manufactured using a less expensive process. Similarly, signals 
h4J0 having higher data rates may be recovered using the same process, as compared to other 

circuits. The linear output generates less supply noise than other architectures. The reduction 

of pattern dependency reduces the pattern dependent offset phase errors that would otherwise 

be present. 

Specifically, one exemplary embodiment of the present invention provides a 
25 method of recovering a clock and data from a data signal. The method includes receiving the 
data signal having a first data rate, receiving a clock signal having a first clock frequency, 
alternating between a first level and a second level, wherein the first data rate is twice the first 
clock frequency. A first signal is generated by passing the data signal when the clock signal 
is at the first level, and storing the data signal when the clock signal is at the second level. A 
30 second signal is generated by passing the data signal when the clock signal is at the second 
level, and storing the data signal when the clock signal is at the first level. A third signal is 
generated by passing the first signal when the clock signal is at the second level, and storing 
the first signal when the clock signal is at the first level. A fourth signal is generated by 
passing the second signal when the clock signal is at the first level, and storing the second 
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signal when the clock signal is at the second level. An error signal is generated by taking the 
exclusive-OR (XOR) of the first signal and the second signal, and a reference signal is 
generated by taking the XOR of the third signal and the fourth signal. 

This embodiment may further include applying the error signal and the 
5 reference signal to a charge pump to generate a charge pump output. 

A further exemplary embodiment of the present invention provides an 
apparatus for recovering data from a received data signal. The apparatus includes a first 
storage device configured to generate a first signal by receiving the received data signal, and 
either passing the received data signal or storing the received data signal, and a second 
10 storage device configured to generate a second signal by receiving the received data signal, 
3 and either passing the received data signal or storing the received data signal. 
j The embodiment further provides a third storage device configured to generate 

H a third signal by receiving the first signal, and either passing the first signal or storing the 
H received first signal, and a fourth storage device configured to generate a fourth signal by 
jl5 receiving the second signal, and either passing the second signal or storing the second signal. 
„ A first logic gate configured to perform an exclusive-OR of the first signal and the second 
U signal; and a second logic gate configured to perform an exclusive-OR of the third signal and 
u the fourth signal are also included. When the first storage device passes the received data, the 
^ second storage device stores the received data, the third storage device stores the first signal, 
20 and the fourth storage device passes the second signal When the first storage device stores 
the received data, the second storage device passes the received data, the third storage device 
passes the first signal, and the fourth storage device stores the second signal. 

Yet a further exemplary embodiment of the present invention provides an 
apparatus for recovering data from a received data signal. The apparatus include a first 
25 storage device having a data input coupled to a first data input port, a clock input coupled to a 
first clock port and a second storage device having a data input coupled to the first data input 
port, a clock input coupled to a second clock port. The apparatus also includes a third storage 
device having a data input coupled to an output of the first storage device, and a clock input 
coupled to the second clock port and a fourth storage device having a data input coupled to an 
30 output of the second storage device, and a clock input coupled to the first clock port. A first 
exclusive-OR gate having a first input coupled to the output of the first storage device and the 
second storage device; and a second exclusive-OR gate having a first input coupled to an 
output of the third storage device and the fourth storage device are also included. The first, 
second, third, and fourth storage devices couple a signal at the data input to the output when a 
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voltage on the clock input is a high, store the signal at the data input when the voltage on the 
clock input is a low. 

A better understanding of the nature and advantages of the present invention 
may be gained with reference to the following detailed description and the accompanying 
5 drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of an exemplary optical transceiver that 
10 incorporates one embodiment of the present invention; 

Figure 2 is a block diagram of a clock and data recovery circuit consistent with 
: ;Q one embodiment of the present invention; 

~& Figure 3 is a block diagram of a VCO that may be used in a clock and data 

^ recovery circuit consistent with one embodiment of the present invention; 

i;fl5 Figure 4 is a schematic of one inverter element of the VCO in Figure 3; 

Figure 5 illustrates a block diagram of a half-rate phase detector that may be 
::r? used in a clock and data recovery circuit consistent with one embodiment of the present 
invention; 

:=5 Figure 6 is a schematic of a latch that may be used in the half-rate phase 

;t 20 detector of Figure 5; 

Figure 7A is a truth table, and Figure 7B is a schematic of an exclusive-OR 
gate that may be used in the half-rate phase detector of Figure 5; 

Figure 8 is a charge pump that may used by one embodiment of the present 

invention; 

25 Figure 9 is a generalized timing diagram for a phase detector consistent with 

one embodiment of the present invention; 

Figure 10 illustrates the timing diagram of Figure 9 with a specific data pattern 
and no phase error; 

Figure 11 is the timing diagram of Figure 10 with a phase error introduced; 
30 Figure 12 shows the error and reference voltages as a function of phase error 

for a half-rate phase detector consistent with one embodiment of the present invention; and 

Figure 13 is a flowchart for a method of recovering data and clock signals 
consistent with one embodiment of the present invention. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Figure 1 is an exemplary block diagram for an optical transceiver which 
incorporates one embodiment of the present invention. This figure, as with all the included 
5 figures, is for illustrative purposes, and does not limit the possible applications of the present 
invention, or limit the appended claims. This optical transceiver may be on a NIC card with a 
media access controller, some memory, and other circuits. Included is a receive path 
including a photo diode 110, sensing resistor 112, pre-amplifier 120, amplifier 130, DC offset 
correction circuit 150, clock and data recovery circuit 140, and link and data detect 160. A 
10 transmit path having an amplifier 170, Light Emitting Diode (LED) driver 180, multiplexer 
175, oscillator 185, and LED 190 is also shown, 
up A receive fiber optic cable 105 carries an optical data signal to the reversed- 

?7L biased photo diode 110. Photo diode 110 senses the amount of light from fiber optic cable 
[jfj 105, and a proportional leakage current flows from the device cathode to anode. This current 
i;ft5 flows though sense resistor 1 12, thereby generating a voltage. This voltage is amplified by 

pre-amplifier 120, and sent to amplifier 130. DC offsets are reduced by DC correction circuit 
r 3 150. The output of the amplifier 130 drives the clock and data recovery circuits 140, as well 
as the link and data detect block 160. The clock and data recovery circuits extract the 
CLOCK signal embedded in the data provided on line 135 by the amplifier, and uses it to 
W retime the data for output on lines 143. If the link and data detect block 160 senses either a 
data or link signal at the data line 135, a valid link signal is asserted on line 167. If the link 
and data detect block 160 senses a data signal at the data line 135, a receive squelch signal is 
de- asserted on line 163. 

Transmit data is provided on line 173 to amplifier 170 . Amplifier 170 is 
25 enabled by the transmit enable signal on line 177. When amplifier 170 is enabled, transmit 
data is passed to the multiplexer 175. Multiplexer 175 passes the transmit data to the LED 
driver 180 which in turn generates a current through light emitting diode (LED) 190. When 
current is driven through LED 190, light is emitted and transmitted on fiber optic cable 195. 
When the LED driver 180 is not driving current though LED 190, the LED is off, and the 
30 fiber optic cable 195 is dark. If the amplifier 170 is disabled, multiplexer 175 selects the idle 
signal from oscillator block 185. Oscillator block 185 provides an idle signal through the 
multiplexer 175 to the LED driver 180. This idle signal is used by the receiver to ensure that 
a valid optical connection has been made at both ends of the fiber-optic cable 105. 
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As discussed above, the physical media limitations distort the received signal. 
Moreover, the delay through the amplifier 170, multiplexer 175, LED driver 180, and LED 
190 may not be the same for a light-to-dark as for a dark-to-light transition. This mismatch 
causes what is referred to as a duty cycle distortion. Further, electrical noise in the power 
5 supply and data path create jitter and phase noise, which is where the delay through the 
transmitter changes as a function of time. It is the function of clock and data recovery 
circuits, such as block 140, to retime the data so it is in a more useable form for digital 
circuits, and provide a CLOCK synchronized to the data. 

Figure 2 is a block diagram of a clock and data recovery circuit 200, which 
10 may be used as block 140 in figure 1, as well as many other applications. This architecture is 
shown for exemplary purposes, and does not limit either the possible applications of the 
g present invention, or the appended claims. Other architectures will be readily apparent to 
ij* those skilled in the art. For example, the charge pump may be included in the low-pass filter. 

Included in figure 2 is a half-rate phase detector 210, charge pump 220, low-pass filter 230, 
Jjs andVCO240. 

^ ^7^ v ^ A differential data input is provided to the half-rate phase detector 210 oniiata 

FU lines 135. Alternately the data input may be single ended. Demuliplexed data i^pfovided on 
i s ij lines 215 and 217. The half-rate phase detector compares data on hnes^35with the clock 
; s ]f signals on line 145, and outputs an ERROR signal on line 222 Jjjafts proportional to the 
20 phase error between the clock and data signals. The half-fate phase detector 210 provides 

this ERROR signal on line 222, as well as a R^FE^ENCE signal on line 224 to charge pump 
220. The REFERENCE signal on lin^22^is a data dependent signal which is used to correct 
for the data dependence of the^EltROR signal on line 222. Charge pump 220 provides a 
correction signal that is^Itered by low-pass filter 230, and sent to the VCO 240. VCO 240 
25 provides the differential clock signal on line 147 which is used by the half-rate phase detector 
210 for retjnling the data input signal. These blocks form a feedback loop in which a clock 
signals extracted from an incoming data stream, and used to retime the data. 

As its name implies, the voltage controlled oscillator is an oscillator, the 
frequency of which is controlled by a voltage, in this case the voltage Vtune provided by the 
30 low-pass filter 230. As the voltage out of the filter 230 changes, so does the oscillation 
frequency. If the data on line 135 and the clock on line 147 do not have the desired phase 
relationship, for example the data edges are occurring too soon, the half-rate phase detector 
210 outputs an ERROR voltage. This voltage drives the charge pump 220, the output of 
which is filtered and applied to the VCO 240. This voltage has the result of increasing the 
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VCO frequency, such that the clock edges advance. When the desired phase relationship is 
achieved, Vtune changes such that the frequency drops back to the "correct" frequency, and 
this loop is said to be locked. Hence, these clock and data recovery circuits are often referred 
to as phase-locked loops, or PLLs. 
5 The frequency of the clock signal on line 147 is half the data rate of the DATA 

signal on lines 135. The frequency of the half-rate phase detector outputs, DATA1 on line 
215, and DATA2 on line 217, are each half the frequency of the DATA signal on lines 135. 
Also, the ERROR signal on line 222 and the REFERENCE signal on line 224 provide a 
relatively low frequency, essentially differential, correction signal. 
10 These features provide several important benefits. For example, using a 

REFERENCE signal gives context to the ERROR signal, reducing any loop dependency on 
the data pattern on DATA lines 135. If there are no data transitions this loop has no ERROR 

rQ or REFERENCE signal information to use to lock, but since there is no data to recover, this 

\^ special case is of no interest. 

f; J5 Further, the VCO 240 operates at half the frequency as compared to 

conventional architectures. This not only saves power and simplifies the circuit design, but 
also enables using slower, more economical processes to achieve the same system function 
and performance. Similarly, higher performing circuits may be implemented on the same 

i'5 process as compared to other architectures. For example, the data rate may be 10 Gbits/sec, 

j 2fe while the VCO runs at 5 GHz. Using two data outputs each operating at half the data rate 
also saves power. This is because it generally takes more than twice the power to double 
circuit speed in the absence of any design improvements. That is, a point of diminishing 
returns is reached where the application of more power fails to increase circuit speed 
proportionately. Therefore, reducing the switching requirements by half results in a greater 

25 than expected power savings. Architectures consistent with the present invention allow low 
cost processes to be used in demanding applications, for which they would not otherwise be 
suitable. 

Also, conventional systems often employ what is known as a "bang-bang" 
phase detector. In bang-bang detectors, for each data edge, depending on its relation to the 
30 clock, a charge-up or charge-down signal is sent to a charge pump. Such detectors alternate 
between advancing and delaying the clock signal from the VCO, and never reach a stable 
point. Accordingly, bang-bang detectors always have a systematic jitter. Moreover, these 
pulses have fast edges containing high frequency components that couple to the supply 
voltage and inject noise into other circuits. Reducing this noise requires either filtering, or 
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using separate supply lines decoupled from each other. By using a low frequency, effectively 
differential signal out, the linear half-rate phase detector of the present invention does not 
have this systematic jitter, and does not disturb the power supply and other circuits to the 
same extent. 

5 Figure 3 is a block diagram for a VCO 300, which may be used as the VCO 

240 in figure 2. VCO 300 includes three inverter elements in series. This architecture is 
generally known as a ring oscillator. The first inverter element is comprised of inverter 330, 
inverter 320, delay 310, and summing junction 340. The second inverter element includes 
inverter 370, inverter 360, delay 350, and summing junction 375. The third inverter element 
10 is comprised of inverter 390, inverter 385, delay 380, and summing junction 395. Clock 
r3 signals at the output of summing junction 340 are driven through inverter 370, and also 

through delay 350 and inverter 360, to the summing junction 375. Signals Vfine and Vcourse 
l0 on lines 305 and 307 adjust the relative weighting of these two paths. For example, if 
m inverter 360 is off and inverter 370 is on, the clock signal at 340 drives through inverter 370 
;;^5 to summing junction 375 with a minimum delay. If inverter 370 is off and inverter 360 is on, 

the clock signal at summing junction 340 drives through the delay 350 and the inverter 360, 
l ; y and the signal is delayed a maximum amount. Alternately, inverter 370 and inverter 360 may 
\Z each be partially on, such that the signal at summing junction 375 is a composite of signals 
O traveling through inverter 370, and delay 350 and inverter 360. In this case the delay from 
20 summing junction 340 to summing junction 375 is somewhere between the minimum and 
maximum delays. 

Figure 4 is a schematic for one of the three VCO inverter elements shown in 
figure 3. Included are a first inverter stage including Ml 410, M2 420, and current sources 
430 and 440, and second inverter stage including M3 445, M4 445, and current sources 465 
25 and 470. The first inverter and the second inverter outputs share load resistors 490 and 495, 
which correspond to the summing junctions shown in figure 3. Signal Vin at lines 405 and 
415 coupled to the first inverter stage and the delay 450. The delay 450 in turn couples to the 
second inverter. Signals Vfine on lines 475 and Vcourse on line 480, and their compliments 
on lines 476 and 481, adjust the relative weighting of each inverter's contribution to the 
30 output signal Vout at lines 425 and 435. Using separate fine and course current sources allow 
for accurate overall delay control and greater noise immunity. The Vout signal at lines 425 
and 435 couple to the next inverter cell's Vin lines 405 and 415. 

As an example, when Vin is asserted high, that is the voltage on line 405 rises 
above the voltage on line 415, Ml 410 turns on, and conducts current from the current 
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sources 430 and 440. This current flows through load resistor 490, dropping the voltage on 
line 425 in relation to line 435. Similarly, after the delay set by delay block 450, M3 445 
turns on and begins to conduct current from current sources 465 and 470. This current also 
flows through load resistor 490, thus completing Vout's high to low transition. 
5 With regards to the specific example shown in figure 2, a differential Vtune 

voltage from the low-pass filter 230 could be used to drive the Vfine and Vfinebar inputs, 
while the Vcourse and Vcoursebar voltages could be driven by a second loop used for 
achieving frequency lock. Alternately, the same loop could be used for frequency lock, or the 
same loop with some modifications could be used. 
10 Figure 5 is a block diagram for a half-rate phase detector which may be used 

';!? as block 140 in the transceiver of figure 1, as well as other applications. Included are a first 
: J latch 510, a second latch 520, a third latch 560, a fourth latch 570, first XOR gate 540, 
i ; Q second XOR gate 550, and buffers 530 and 580. All signal paths are shown as being 
'^2 differential, but may alternately be single-ended. For example, the DATA may be a single- 
M 5 ended signal on line 505, with line 515 coupled to a bias point, preferably at a voltage 
1=3 approximately equal to the middle of the DATA input voltage swing. In the preferred 
; embodiment shown here, all signal paths are differential, except for the ERROR and 

I'll REFERENCE signal paths, which are single-ended. Using differential signals reduces the 
i!i jitter caused by noise from such sources as the power supply and bias lines. Modifications to 
20 this block diagram will be readily apparent to one skilled in the art. For example, the first 
and second latches may be replaced by a flip-flop. 

Clock signal CLOCKX clocks the first latch 510 and the fourth latch 570. 
Complementary clock signal CLOCKY clocks the second latch 520 and the third latch 560. 
Differential data signal is provided on lines 505 and 515 to the first latch 510 and the third 
25 latch 560. When the CLOCKX line is high, data on lines 505 and 515 pass to the A lines 5 1 1 
and 512. When CLOCKX is low, data on lines 505 and 515 are latched in first latch 510. 
Conversely, when the CLOCKX signal is high, the CLOCKY signal is low, and data on lines 
505 and 515 are latched by the third latch 560. When the CLOCKY signal is high, data on 
lines 505 and 515 pass to the B lines 561 and 562 at the output of the third latch 560. Signals 
30 on the A line 51 1 and 512, and the B lines 561 and 562 are XORed by the first XOR gate 540 
producing an ERROR signal on line 585. 

Signals on the A lines 511 and 512, and the B lines 561 and 562, are latched 
by the second latch 520 and the fourth latch 570. Specifically, when the CLOCKY signal is 
high, the signal on lines 511 and 512 pass through the second latch 520 to lines C 521 and 
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522. But when CLOCKY is low data on lines 511 and 512 are latched by the second latched 
520. Similarly when CLOCKX is high, data on the B lines 561 and 562 pass through the 
fourth latch 570 to the D lines 571 and 572. When CLOCKX is low, data on the B lines 561 
and 562 are latched by the fourth latch 570. Data at the outputs of the second latch 520 and 
5 the outputs of the fourth latch 570 are XORed by the second XOR gate 550 producing a 

REFERENCE signal on line 555. Data at the outputs of the second latch 520, the C lines 521 
and 522, drive the first buffer 530 which outputs the first the demultiplexed data signal on 
lines 535 and 545. The outputs of the fourth latch 570, lines D 571 and 572, drive the second 
buffer 580, which outputs the second demultiplexed data signal on lines 565 and 575. The 
10 ERROR signal on line 585 and REFERENCE signal on line 555 are sent to the charge pump. 
To improve performance, sone circuit delay time and trace paths 



j matched to each other. Specifically, the first latch clock-to-output^ the traces 

* coupling the first latch to the second latch and the XQR-gate540 should match the third latch 
H clock-to-output delay and the tracescoypliiiglhe third latch to the fourth latch and the XOR 
J 5 gate 540. Also, the secopd4atcnclock-to-output delay and the traces coupling the second 
^ latch to the bji£fer530 and the XOR gate 550 should match the fourth latch clock-to-output 

deby^nd the traces coupling the fourth latch to the buffer 580 and the XOR gate 550. 
y Figure 6 is a schematic for an exemplary circuit implementation of a latch 

i used for the first latch 510, and fourth latch 570, in figure 5 by one embodiment of the 
20 present invention. It will be obvious to one skilled in the art that other latches can be used, 
for example a bipolar latch could be used. The second latch 520 and third latch 560 may be 
similar, with the exception that the CLOCKX and CLOCKY terminals are reversed. 
Included are input differential pair devices Ml 610 and M2 620, latching devices M3 630 and 
M4 640, clock input devices M5 670 and M6 680, and current source M7 690. Current for 
25 the latch is generated by M7 690. A voltage VCS is applied to the gate of M7 690 resulting 
in a bias current flowing in its drain. This current is steered through either M5 670, or M6 
680, by the CLOCKX and CLOCKY signals on lines 675 and 685. If the voltage on line 675 
is higher than the voltage on 685, that is CLOCKX is high and CLOCKY is a low, the drain 
current of M7 is steered through M5 670 to the differential pair Ml 610 and M2 620. In this 
30 case, the signals DX on line 615 and DY 625 are passed to the output lines QX 635 and QY 
645. For example, if the signal DX on line 615 is higher than the signal DY on line 625, the 
current from M5 670 flows through Ml 610 across load 650 pulling QY line 645 low. M2 
620 is off, the voltage at QX lines 635 is pulled up to VCC, and is high. Conversely, if the 
signal DX on line 615 is lower than the signal DY on line 625, Ml is off, QY is high, M2 is 
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on, and the current from M5 flows through the load resistor R2 660 and the signal QX 635 is 
low. 

If the signal CLOCKX on line 675 is lower than the voltage of the signal 
CLOCKY on line 685, M5 is off, and the current from the drain of M7 690 passes through 
5 M6 680. If QX on line 635 is high and the signal QY on line 645 is low, M3 630 is on, and 
M4 640 is off. The current from M6 680 flows through M3 630 across load resistor Rl 650 
pulling down and keeping QY on line 645 low. M4 649 is off, whereby the signal QX on line 
635 remains high. In this way the data on lines QX 635 and QY 645 remain latched. Input 
pair devices Ml and M2 are both off, so any signal changes at DX and DY, lines 615 and 625 
10 have no effect on the output signals QX and QY on lines 635 and 645. 
r : 3 Figure 7A is a truth table for an XOR gate. The XOR function is 

Ct deconstructed into the OR of 2 NOR terms in column 702. The XOR gate shown in figure 
;;p! 7B is designed using this equivalent expression. 

i;n Figure 7B is an exemplary XOR gate implemented in accordance with truth 

Q.5 table column 702 of figure 7 A. The alternative expression from figure 7 A allows for 

designing an XOR gate without using stacked devices. This in turn, eliminates the mismatch 
I U between gate delays for different inputs which otherwise result. Included in this XOR gate 
j y are a first NOR gate including Ml 705, M2 710, and M3 715, a second NOR gate including 
; J _ M4 720, M5 725, and M6 730, current sources M9 7 40 and M10 745, output current mirror 
20 M7 750 and M8 755, output load resistor 765, and output offset current source 760. 

Bias voltage VCS is applied to the gates of M9 7 40 and M10 745, thereby 
generating bias currents in their drains. The signals AX on line 707 and BY on line 712 
swing above and below the signal voltage VB on line 717. For example if either AX or BY 
are higher than VB, the drain current of M9 is shunted through Ml 705 or M2 710 to VCC, 
25 line 733. If both the AX signal on line 707 and the BY signal on line 712 are low or below 
the voltage VB on line 717, the current from the drain of M9 passes through M3 715 to M7 
750. Similarly, only if the signals AY on line 727 and BX on line 732 are low or below the 
voltage VB on line 717, does the current from the drain of M10 pass through the device M4 
720 to M7 750. Current in M7 7 50 is mirrored in the drain of M8 755 and applied across 
30 output load resistor 765, generating a voltage at the X output on line 757. Output offset 
current 760 creates a DC voltage at the output for proper biasing to the charge pump. 

Using PMOS devices for M7 750 and M8 755 limits the ERROR and 
REFERENCE signals' bandwidth. The XOR gate smoothes the logical outcome by first 
performing a high-speed logical operation, and then low-pass filtering the output. But this is 



advantageous to the overall system. For example, the power supply noise injected by this 
XOR gate is very limited, since high frequencies are attenuated by the PMOS mirror. Also, 
this gate provides some high frequency filtering, thus reducing the burden on the following 
charge pump and low-pass filter. 
5 5> ^ a * ternate embodiment for an XOR gate can be foundjnjxjjnnrc 

assigned U.S. patent application number , filed , titled "Linear Full- 

Rate Phase Detector and Clock and Data Recoj^eiyt^rciiit," attorney docket number 019717- 
001210US, which is incorporated b^cefgfence. Also, other architectures which may be used 
to implement some ofthgjcirduits herein can be found in commonly assigned U.S. patent 

10 application^jittifiGer , filed , titled "C 3 MOS Logic Family," attorney 

dpeKet] number 019717-0003 10US, which is incorporated herein by reference. 

Figure 8 is a charge pump which may be used as charge pump 220 in figure 2, 
I as well as other applications. Included are a common mode circuit including M7 805, M5 
i 810, M6 815, M8 830, M9 835, and amplifier including Ml 820, M2 825, M3 840, and M4 
1 5 845, and current sources M10 850 and Ml 1 855. The ERROR signal on line 822, and the 
REFERENCE signal on line 827, are provided as differential inputs to the amplifier. Bias 

1 

j voltage VCSP is applied to the gates of M10 850 and Ml 1 855, thereby generating currents 
■ 5 in their drains. If the ERROR voltage on line 822 rises above the REFERENCE voltage on 
3 line 827, the current through Ml increases and the voltage on line 817 increases. 
20 Correspondingly the current and M2 825 decrease, thereby lowering the voltage on line 812. 

If the common mode voltage on lines 817 and 812 is too high, devices M5 810 
and M6 815 shut off The current in M7805 increases, thus increasing the current in M9 835 
which is mirrored in devices M3 840 and M4 845. This increase in current tends to drive 
down the voltages on lines 817 and 812. In this way, Vout's common mode is adjusted to be 
25 centered around the voltage Vref on line 807. The charge pump provides a differential output 
voltage proportional to the difference between the ERROR and REFERENCE signals, and 
provides them at the correct bias point for use by following circuits. 

Figure 9 is a timing diagram for a half-rate phase detector consistent with one 
embodiment of the present invention, such as the circuit illustrated in figure 5. This and the 
30 following timing diagrams are not limited to the circuit shown in figure 5 however, and may 
be generated by other circuitry consistent with the present invention. Shown are input 
waveforms CLOCKX 900 and DATA 910, and resulting waveforms A 920, B 930, ERROR 
940, C 950, D 960, and REFERENCE 970. In a preferred embodiment, CLOCKX 900 is 
approximately a 50 percent duty cycle waveform, but clock signals with other duty cycles, 
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such as 33 or 67 percent may be used. Alternately, other duty cycles may be used consistent 
with the present invention, such as 40 or 60 percent. Waveform A 920 follows the DATA 
signal 910 when CLOCKX 900 is high. When CLOCKX 900 returns low, the signal on 
waveform A 920 is latched, or stored, and does not respond to changes in DATA 910. 
5 Conversely, waveform B 930 follows the DATA signal 910 when CLOCKX 900 is low. 
When CLOCKX 900 returns high, the signal on B 930 is latched, or stored, and does not 
respond to changes in DATA 910. 

The signal A 920 is XORed with the signal B 930, resulting in the ERROR 
waveform 940. For some time following each CLOCKX rising and falling edge, the A 920 
10 and B 930 signal levels are equal. This is because one signal has just latched, while the other 
S had been latched and is now following the DATA signal 910. During this time the ERROR 

2 940 signal level is low. If the signal level at DATA 910 changes, the ERROR signal 940 is 

y 

U asserted high. If the DATA 910 signal level does not change, but rather remains constant, 
;q ERROR 940 remains low. For example if data bit 902 and data bit 904 are both low, then 
^1 5 ERROR signal portion 906 is low. 

;3 Waveform C 950 follows signal A 920 when CLOCKX 900 is low. When 

: J CLOCKX 900 returns high, C 950 is latched, its value stored, and accordingly its value is 
^ retained until CLOCKX 900 returns low. Waveform D 960 follows signal B 930 when 
i CLOCKX 900 is high. When CLOCKX 900 returns low, signal D 960 is latched, its value 
20 stored, and so its value is retained until CLOCKX 900 returns high. Signals C 950 and D 960 
are the demultiplexed data outputs. For example, data bits of DATA signal 910 have been 
sequentially labeled 0, 1,2, and so on. Waveform C 950 comprises the odd bits of DATA 
waveform 910, and waveform D 960 comprises 0 and the even bits of DATA waveform 910. 
Waveforms C 950 and D 960 are XORed, resulting in REFERENCE 970. 
25 ERROR signal 940 is dependent on the phase relationship between DATA 910 

and CLOCKX 900 in the following manner. For example, if data bit 904 is low and data bit 
912 is a high, then ERROR pulse 916 is high. If the DATA signal 910 advances, that is 
shifted to the left, then pulse 916 in the ERROR signal 940 widens (becomes longer in 
duration). If the DATA signal 910 is delayed, that is shifted to the right, then pulse 916 of 
30 ERROR signal 940 narrows (becomes shorter in duration). But note as above, if data pulse 
904 and data pulse 912 are equal, then data pulse 916 is low. Therefore, the average voltage 
of ERROR waveform 940 is dependent not only on the phase error between CLOCKX 900 
and DATA 910, but on the data pattern of DATA 910. For this reason, the ERROR signal 
940 is most meaningful in the context of REFERENCE signal 970. 
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If we assume random data, that is the probability of each in data bit and 902 
being higher low is equal, then half of all ERROR pulse those 906 are high and half are low. 
If the CLOCKX 900 and DATA 910 signals are in quadrature, that is they are at right angles 
or ninety degrees shifted apart, then for half the time between clock edges the ERROR signal 
5 940 is low, and half the time it is an ERROR pulse that may be low or high. Accordingly, for 
random data, when phase lock is achieved, the average signal level of ERROR signal 940 is 
one-fourth its peak value. 

The average value of REFERENCE signal 970 is also data dependent. For 
example, if data bit 902 and 904 are both low then REFERENCE bit 918 is low. But if data 
10 bit 904 and data bit 912 are not equal, REFERENCE bit 928 is high. For random data the 
probability of two consecutive bits being equal is the same as the probability of two 
consecutive bits being unequal. Thus, half the REFERENCE bits 918 are low, and half are 
^ high. Therefore, the average value of the REFERENCE signal 970 is half its peak value. 

^ If the data is not random, for instance if DATA 910 is a long string of < 
jl 5 high or low data bits, then ERROR pulses, such as 906, and REFERENCE pulsej^stfch as 

917 are low. The of the ERROR signal's average value is at a minimum^^s'is the 
0 REFERENCE signal 970. But if the data changes every bit, then e^etffERROR signal pulse 
H and each REFERENCE bit is high. Therefore the ERROR sigrfal is equal to half its peak 
=3 value and the REFERENCE signal equals its peak valxi^Thus, the ERROR signal and 
20 REFERENCE signal divided by two have the sam^tfata pattern dependency, while the 
ERROR signal also tracks the phase error. This means the data dependency of ERROR 
signal 940 can be corrected by subtracting'half the average value the REFERENCE signal 
970. From a circuitry implementatipfC this means in figure 7, PMOS mirror devices M7 750 
and M8 755 should be scaled differently for XOR gates 540 and 550 in figure 5. 
25 Specifically, either M8 caj*1>e doubled, or M7 can be halved in XOR gate 540 as compared to 
XOR gate 550. The difference signal between the ERROR and one-half the REFERENCE 
signals is not dependent on the data pattern, but is dependent on the phase error. This 
resulting sigrtjiu has approximately a zero value when the DATA signal's edges are aligned 
with the^enter between the CLOCK edges. As the DATA is delayed, the differential value 
30 becpfnes negative. As the DATA advances, the difference becomes positive. 

This pattern dependency reduction of the half-rate phase detector output 
reduces the pattern dependent phase error that would otherwise occur, though there may be 
random pattern dependent jitter that would remain unaffected. 
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Each data bit has a duration ti 943. The reciprocal of the data bit duration ti 
943 is referred to as the data rate. Each clock period has a duration t 2 947, where t 2 is equal 
to twice ti. The clock frequency, or clock rate, is the reciprocal of the duration tj 947. 
Therefore, the clock frequency is half the data rate. It is interesting to note that the data and 
5 clock signals' switching frequency, that is the reciprocal of the duration between rising and 
falling edges, is the same. In conventional systems, the clock's switching rate is twice what 
is shown in figure 9. Accordingly, the VCO's bandwidth and related clock path for circuitry 
implementing figure 9 is half that of conventional systems. This provides a savings in power, 
and eases the complexity and risk of the circuit design. 
10 Figure 10 is a timing diagram of the various waveforms for a half-rate phase 

l: !f detector used in one embodiment of the present invention. Included are input waveforms 
H CLOCKX 1000 and DATA 1010, and resulting waveforms A 1020, B 1030, ERROR 1040, C 
nj 1050, D 1060, and REFERENCE 1070. This timing diagram is for a specific DATA 1010 
■^l input pattern. Each transition in DATA 1010, such as 1002 and 1004, results in pulses in 
% ERROR signal 1040, specifically 1006 and 1008, and high REFERENCE bits 1012 and 1014. 

Figure 1 1 is a timing diagram of the various waveforms for a half-rate phase 
\ ^ detector used in one embodiment of the present invention. Included are input waveforms 
i : U CLOCKX 1 100 and DATA 1 1 10, and resulting waveforms A 1 120 B 1 130, ERROR 1 140, C 
It 1 1 50, D 1 1 60, and REFERENCE 1 1 70. DATA waveform 1 1 1 0 is the same as DATA 
20 waveform 1010 in figure 10. In this specific example, DATA waveform 1110 has been 

delayed relative to CLOCKX waveform 1 100. Again, each transition in DATA waveform 
1110, such as 1 102 and 1 104, results in pulses in ERROR waveform 1 140, specifically 1 106 
and 1 108, and high REFERENCE bits 1 1 12 and 1 1 14. But this time, since the DATA 
waveform 1110 has been delayed, ERROR pulses 1 106 and 1 108 are narrower than the 
25 corresponding pulses 1006 and 1008 in figure 10. Accordingly, the average value of ERROR 
signal 1 140 is lower than the average value of ERROR signal 1040 in figure 10. 
REFERENCE bits 1 1 12 and 1 1 14, however, are the same as REFERENCE bits 1012 and 
1014 in figure 10. Therefore, the same DATA waveform 1 1 10 in figure 11, and 1010 in 
figure 10, results in a lower ERROR value. But the same REFERENCE signal, shown as 
30 1 170 in figure 1 1 and 1070 in figure 10, is achieved, so the REFERENCE signal is 
independent of the phase error, but it is dependent on the data pattern. 

Figure 12 graphs the ERROR voltage and REFERENCE voltage outputs for a 
half-rate phase detector consistent with one embodiment of the present invention. The 
voltages of ERROR signal 1210 and REFERENCE signal 1220 are graphed as a function of 
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the phase error between the data and clock signals. ERROR signal 1210 is proportional to 
the phase error. ERROR signal 1210 may be linear. Alternately, ERROR signal may have 
nonlinear characteristics. REFERENCE signal 1220 is approximately independent of the 
phase error, but is a function of the data pattern. REFERENCE signal 1220 may become 
5 discontinuous or notched when the phase error is near plus or minus 180 degrees. 

Figure 1 3 is a flow chart for a method of recovering data and clock signals 
from a data stream consistent with one embodiment of the present invention. In act 1310, a 
data input signal, a clock signal, and the clock signal complement are provided. The data 
input is applied to a first latch clocked by the first clock signal in act 1320. The data is 
10 applied to a second latch clocked by the complementary clock signal in act 1330. In act 1340 
t the first latch's output is applied to a first XOR gate and a third latch. The second latch's 
4 output is applied to the first XOR gate and a fourth latch in act 1350. In act 1360, the third 
n latch's output and the fourth latch's output are applied to a second XOR gate. The first XOR 
l 2 gate's output is used as an error signal, the second XOR gate's output is used as a reference 
45 signal, the third latch's output is used as a first data output, and the fourth latch's output is 
3 used as a second data output in act 1370. 

~ In act 1380 the error signal is subtracted from half the reference signal, and 

U filtered. The filter output is used to adjust the clock signal and its complement in act 1390. 
7 Embodiments of the present invention have been explained with reference to 

20 particular examples and figures. Other embodiments will be apparent to those of ordinary 

skill in the art. Therefore, it is not intended that this invention be limited except as indicated 

by the claims. 
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